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SUMMARY 


For second-order elliptic boundary value problems, we develop a nonconforming multigrid 
method using the coarser-grid correction on the conforming finite element subspaces Th 
Tnvergence proof with an arbitrary number of smoothing steps for V-cycle is presented. 


1. INTRODUCTION 


Let ft be a convex 
exists ao such that a 
multigrid method for 


polygon in R ! . Let / £ L\Sl), a € C'( ft) nnd f £ C°(ft). We assume there 
> a 0 > 0 and j3 > 0. In this paper we discuss convergence properties ot the 

solving the Dirichlet problem 

-V • (aVu) + fiu = / in ft , 
u = 0 on 5ft , 


( 1 ) 

( 2 ) 


using PI nonconforming finite elements(see [5, 6]). 

The prototype of the multigrid convergence theory is that 

For some number of smoothing steps the multigrid process is a contraction for some 
norm. Moreover, the contraction number is independent of the mesh size h. 


This was proved for conforming multigrid methods by Bank and Dupont[l]. Braess and 
Hackbusch[2l and Hackbusch[8] proved this for the V cycle with one smoothing step. For the 
nonconforming multigrid method, this was proved by Braess and Verfurth[3] and Brenner [ ] or 
W-cycle under the condition that each iteration step contains many smoothing steps. 

The method presented in this paper consists of a smoothing step on the nonconforming finite 
elemenUpace of the finest-grid and correction step which is obtained by the conforming multigrid 

•This research was partially support by the National Science Foundation under Grant No. CDA- 
9024618 and DMS-9203502. 
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method on the conforming finite element subspaces of coarser-grids. The standard nonconforming 
multigrid which was proved by Brenner in [4] is based on smoothings and correction on the 
nonconforming finite element spaces. The important difference is that V k -\ V k and C V k , 
where V k and W k are the nonconforming and conforming finite element spaces on mesh level k, 
respectively. Hence we can simply use the natural injection for the intergrid transfer of grid 
functions and this intergrid transfer operator preserves the energy norm. Moreover, the error of the 
coarser-grid correction is orthogonal to W k . h Owing to these, the standard proof of convergence in 
[2] for the V-cycle of one smoothing step of the conforming multigrid method carries over directly. 
In [3] Braess and Verfurth added the step length parameter in the correction step of the standard 
nonconforming multigrid algorithm to improve the convergence. They proved the convergence of 
two-level case of this modified standard nonconforming multigrid with one smoothing step. The rate 
of convergence of their algorithm should be better than or at least equal to that of the standard 
nonconforming multigrid method but it needs more cost for each iteration. While Brenner proved 
the convergence of the standard nonconforming multigrid algorithm only for the W-cycle it is 
convergent for the V cycle with one smoothing step in real computation. Also the modified 
standard nonconforming multigrid algorithm converges for the V cycle with one smoothing step in 
real computation. Our multigrid method is easier to implement and more effective because it needs 
fewer computations and communications in a parallel sense. These computations were done in 
CM-5 Vector Units*. 

This paper is organized as follows. In Section 2 we discuss the fundamental estimates from the 
theory of finite elements and the intergrid transfer operator. The multigrid algorithm is discussed 
m Section 3. Section 4 contains the contracting properties of the Jfc-level iteration. In the last 
section we compare the computational results of three algorithms. 


2. THE FINITE ELEMENT SPACES 


The variational formulation for (1) and (2) is defined as follows: Find u 6 such that 

a(u,v) = F(v) Vv e Hq(CI) , 

where 

a(u,v)= I (aVu • Vu + (duv) and F(v) = [ fv. 

Jn 

Here, #(>(0) denotes the usual Sobolev space (see [5]). 

Let {T }, k > 1 , be a family of triangulations of S7, where T * +1 is obtained by connecting the 
midpoints of the edges of the triangles in T k . Let h k := max r€T * diamT, then h k = 2h k+1 . 
Throughout this paper, C denotes the positive constant independent of k which may vary from 
occurence to occurence even in the proof of the same theorem. 

t These results are based upon a test version of the software where the emphasis was on providing functionality 
and the tools necessary to begin testing the CM5 with vector units. This software release has not had the benefit of 

optimization or performance tuning and, consequently, is not necessarily representative of the performance of the full 
version of this software. 
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It is worth pointing out the motivation of the nonconforming finite elements. In the stationary 
Stokes problem for an incompressible viscous fluid, it is realized that a major difficulty exists in the 
numerical treatment of the incompressibility condition. Crouzeix and Raviart in [6] advocated the 
method that the incompressibility condition is approximated. They have found it very convenient 
to use nonconforming finite elements for this purpose. By Uzawa’s method the Stokes equation is 
reduced to a sequence of Dirichlet problems for the operator —A. Thus we shall develop a 
nonconforming multigrid method for solving (1) and (2). 

Now let’s define the nonconforming finite element space 

V k := {u : v|j is linear for all T G T k , v is continuous at the midpoints 
of the edges and v = 0 at the mid points on . 


Note that functions in V k are not continuous. 

We also use a conforming finite element space for our multigrid method NC-CMG. Define 

W k := {w : w\j is linear for all T 6 T k , w is continuous 

on fl and w Ian = 0} • 

The space V k will be used in the finest-grid space and W k in the coarser-grid spaces to obtain 
NC-CMG. Observe that W k = V k f] Hq(Q) = V k D V k+l . 

For each k, define (on V k + H\ (fl)) 

a k (u,v):= ^2 / • Vv + /?ui>) 

xer* Jr 


and the energy norm induced by a k 

IMIfc := y/a k (u,u). 

The bilinear form a fc (-, •) is symmetric and positive definite on V k . Moreover, we have the inverse 
estimated] , . 

Vu€V fc . (3) 

We also note that if it, u G #o(ft), then a k (u,v ) = a(u,v). 

We now recall some fundamental estimates from the theory of finite elements. 

Since / G L 2 ( fl), elliptic regularity implies that u G if 2 (fi)(see [7]). For the same /, let u k G V* 
satisfy 

a k (u k , v)= f fv Vu G V k 

and let u k G W k satisfy 

a k (u k ,v)= fv VvGW*. 

J Q 

Since V k satisfies the patch test (see [11]), we have the following estimate for the discretization error: 

||u - u k \\ L 2 + h k \\u - u fc ||fc < CfcJlMljf* ( 4 ) 
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(see [6]). The estimate for the conforming descretization error is, of course, well known(see [5]): 


u Wfc l 2 4" hk U — U k \\ k ^ C/lj[||u H 3 • 


( 5 ) 


From the spectral theory, there exist eigenvalues 0 < Ai < A 2 < • • • < A„ k and eigenfunctions 
tfo, tfo, • ■ ■ , Vv € V*, (V»i, ^i)l 3 = Sa (= the Kronecker delta), such that a k (ipi, v ) = A XV’i, v) L i for 
all v € V*. Frogi the inverse estimate (3), there exists C > 0 such that 


A< < Chi 2 . 


( 6 ) 


The same results hold for the conforming finite element spaces. The norm |u|, (fc is defined (see [1]) 
as follows: 


' n k 


\i=l 


1/2 




where v = ^ i £ V* . 


•=1 


Moreover, 


( 7 ) 

( 8 ) 


IHo,*=IMU’ and |M||i,* = ||u||fc. 
g ; 

And, the Cauchy-Schwarz inequality implies 

[afc(v,t^)| < Ifluffli+t.fcHi-i.fc 

§r • - 

for any t G R and v, w G V k . 

For u e Vfc_i the intergrid transfer operator I [ : V*_i -> V k is defined as follows. Let p be a 
midpoint of a side of a triangle in T k . If p lies in the interior of a triangle in T k ~\ then we define 

(Jt.uXp) := v(p). 

Otherwise, if p Des on the common edge of two adjacent triangles T\ and T 2 in T k ~\ then we define 


(7jt-i v )(p) := oMt,(p) + u lr 2 (p)]- 


From the definition of I%_ lt it is clear that 

It-i v = v Vt> € W k .i = Fit D V*_i C ifo(fl) 
In other words, /jtLilw*., is just the natural injection. 

Now we are ready to state an approximation property. 


Lemma 1 Given ueV k let u* € W k - 1 be the solution of 

a k (u - u*, v) = 0 Vv 6 W fc _ i 

Then 

Iu-U*|l, fc <CM«|2, fc . 
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Proof. Let g € Vk satisfy 

(S,v) = a*(u, i>) VuGF*. 

Then 

Vv € Wfc_i, a*(u*,v) = a k (u,v) = (g,v). 

Now let w e be the solution of the Dirichlet problem 

—V • (aVw) + /3w = g in fi 

w = 0 on . 

Then by elliptic regularity Hlff’ < C\\g\\v. It follows from the discretization error estimates (4) 
and (5) that 

( 9 ) 

( 10 ) 


lit — U* L 3 ^ ||tt — || £,3 + \w — U ll^a 


But 

Therefore, 


< Ch 2 ||u>||tf J 

< ch*M». 

h = = a *KsO ^ HMIkfclklk 2 


MU* < IIMIk*- 

Combining inverse estimate (3) and (11), we obtain 

If u — tt*|||i,fc < ~^\\ u ~ u *lk j — n 


(ii) 


3. THE MULTIGRID ALGORITHM 


Now, we consider a decreasing sequence of mesh size h k : 

h(, > h\ > • • • > hk > • • • > hknit • 


We first describe the fc-level iteration scheme of the conforming multigrid algorithm. The fc-level 
iteration with initial guess z 0 yields CMG(k, z 0 , G) as a conforming approximate solution to t e 
following problem. 

Find zeW k such that a k (z, v) = G{v) Vu € W k , where G G W' k . 

Here, WL is the dual space of Wt. For k = 1, CMG(Mo,C) is the solution obtained from a direct 
method. For it > 1, CMG(k, z r „ G) = z m + /£_,?, where the approximation z m £ W t is constructed 
recursively from the initial guess zq and the equations 


1 


Here, A* is greater than or equal to the largest eigenvalue of A k which is the stiffness matrix of a* 
in the conforming finite element space W k , and m is an integer to be determined later. The 
coarser-grid correction q £ W k _ 1 is obtained by applying the ( k — l)-level iteration 1 time. In other 
words, it is the V-cycle multigrid method. More precisely, 

q = CMG(k- 1,0,6) 

where G € W’ k _ x is defined by G(v) := G(lg_ x v) - a k (z m ,tf_ lV ) for all v £ W k _ x . 

The nonconforming multigrid algorithm of this paper is as follows: The Jb^-level iteration with 
initial guess z 0 yields NC-CMG(k ma , x , z 0 , F) as a nonconforming approximate solution to the 
following problem. 


Find z £ such that 

” ) = F{v ) = f fv Vt> £ V km „. (12) 

J ii 

For kjnax = 1, NC‘OMG{\^ zq, F) is the solution obtained from a direct method. 

For k max > 1, 

Smoothing Step; the approximation z m E V k is constructed recursively from the initial guess zo 
and the equations 

Zi = z,-_ i + (F - A km „Zi-i), 1 < i < m . (13) 

Here, A* m „ is greater than or equal to the largest eigenvalue of A kmxx which is the stiffness 
matrix of a kmxx in the nonconforming finite element space 

Correction Step: The coarser-grid correction q £ W k . i is obtained by applying the {k m3 . x - l)-level 
conforming iteration 1 time. More precisely, 

q = CMG(k m&x - 1,0, F) 

where F £ W' kmxx _ x is defined by F(v) := F(tf_ x v) - a k (z m , l£_ lV ) for all v £ W k . x . 

Put 

NC-CMG(k mAX , z 0 , F) = z m + • 
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4. ESTIMATE OF CONVERGENCE RATE 


Now, we can proceed with the well-known analysis of the conforming multigrid method in [ 2 ]. 
Define the linear mapping J : V* — ► V* by 


Jw = Yj (l “ T^~) fa for w = ■ 

i \ ''max J j 


Here A;’s are the eigenvalues of a k . The smoothing step ( 13 ) amplifies the error e< = z — z, by J, i.e., 
ti = Je,_i . Note that J is a self adjoint and semidefinite operator with respect to the energy norm. 

Define the weaker seminorm 

M 2 f 1 “ Y 1 -) v] for w - 

i \ A max J i 

From ( 7 ) and (8) we know |[u>||| = £ A<i/? and |to| < ||u;||*. Define the ratio 

:= | HVIHI* if u> ^ o , 


0 


if to = 0 


It can be regarded as a measure for the smoothness of to E V k because for a smooth function the 
coefficient V{ for small A,-’s dominate and |to| « ||io||fc. 

Lemma 2 Given to € V* put p = p(J m w). Then 

\\J m w\\ k <p™\\w\\ k . 

Proof. Similar to the proof of Lemma 4 . 3 . in [2]. 0 

Let 9(6 W*_i) be the exact coarser-grid correction i.e. 

v) = F(v) - a k (z m , v) Vv 6 W k . 1 . 

Define 

Qe-m ' = 9 

Then Q is the a^-orthogonal projector from V k into W^. Note that q is a^-orthogonal projection 
of e m into W k _\. 

Lemma 3 Given w € V k we have 

IIIQHk* < min |l, C\Jl — Ihli.fe. 
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Proof. For w = £ i/rf <, we have 






i/.- 





It follows from Lemma 1 that ||Qu»|i ( * < C , h|fio§2,fc. This and the estimate (6) for A max imply 

IOlU < Ck 2 A m „(Iu.|!,»-|«.| 2 ) 

< c(l<<»-M 2 ) 

= c(i - 


Moreover, since Q is an orthogonal projector, we have 

IIQHIIi.fc — min jl, C\/l - p(u;)j |u;|||i ifc . D 


We are now (as in [10]) in a position to define three multigrid iterative schemes for the solution 
of (12). 

1. the symmetric scheme NC-CMGVk‘. symmetric smoothing NC-CMG scheme 

2. the coarse-to-fine cycle NC-CMG/ k- postsmoothing NC-CMG scheme 

3. the fine-to-coarse cycle: NC-CMG\k' our NC-CMG scheme. 


In particular, we have[10] 

\\NC-CMG/k\\ k = \\NC-CMG\k\\k, 

\\NC-CMGV k \\k = ||JVC-CMG\*|| 2 fc . 

The symmetrical method NC-CMGV enables us to use estimates with respect to the energy norm 
and to apply a duality argument. 


Lemma 4 The multigrid algorithm NC-CMGVk has a convergence factor 

\\NC-CMGVk\\k< max p 2m {e + (l — e)min(l,C[l -/>])}, (14) 

0<P<1 

with respect to the energy norm, e is the error in ( k — 1 )-level CMGVk-i and the constant C is 
independent of k and m. 
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We note that the right-hand side of (14) is a monotone function of e due to the cut-off induced 
by the min-operation which is contained in the expression. 

Proof. 

Zm+i = z m + q + ew' (i.e. \\q - g||* < e\\q\\ k ) 
with some w' € W k ~\. Hence the error is 

e m+ 1 = e m -q-ew' = Qe m - tw' . 


Since Qe m is orthogonal to W k ~\ and w' G W k - 1 , we get 

||Qe m — w'\\\ = IIQ^mllk + ll^llfc ^ ||Q e m||fc + ||?|U 

< iw«-iii+ii(r-o)«-iii= uvul- 


as) 

(16) 


In order to estimate the final error e 2 m+i = J m e m +i, we use a duality argument: 

||e2m+i|U = sup^a(it), e2m+i)/||^|U- Note that (16) , Q 2 = Q and Cauchy-Schwarz’s inequality 
imply 


ak(u>,e 2m+ i) 


= a k (w, J m (Qe m - tw')) 

= a k (J m w, (1 - e)Q 7 e m + e(Qe m - w ')) 

< (1 - e)a k (J m w, Q 2 e m ) + e|| 7 m t&||fc||e m ||* 

< (1 - €)\\QJ m w\\ k \\QJ m e 0 \\ k + e\\J m w\\ k \\J m e 0 \\ k 

< 1(1 - + «|| J“A||!] I/! [(1 - «)IIOJ n eo||J + -=l|J' m e 0 ||J| 1 / 2 . 


Given w €V k by the Lemmas 2 and 3 it follows that 

(1 - e)\\Q J m w\\\ + e||«7 m u;||! < p 2m {e + (1 - e) min(l, C[1 - p])}|MI* , 
where p — p(J m w). Hence 

||e 2 m+i||fc < maxp 2m {c + (1 - e)min(l,C[l - p))}||e 0 |U. □ 


Theorem 5 If ||CMG\fc_i||*;_i < where c ^ 2m — & < then 

\\NC-CMG\ k \\ k <6 1 ' 3 . 


Proof. We conclude from Lemma 4, 


\\NC-CMGV k \\ k = 


max 


0 <p<i 


p 2 "{« + (l-<)min(l.C(l-^))}, 


because |jCiWGV*-lllk-l = ||C'A/C?\fc_i||^_j < 6. Maximum 6 is attained at p = 1 when 

||JVC-CMG\*||* = \\NC-CMGV k \\]/ 2 < 6 l/i . □ 
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Table I: Number of Grid = 8 i.e. h = 1/8 



S-NCMG 

M-NCMG 

NC-CMG 

smoothing 

iter 

time(sec) 

iter 

time(sec) 

iter 

time(sec) 

1 

4 

.909 

3 

.788 

3 

.233 

2 

3 

.689 

2 

.523 

2 

.156 

3 

2 

.471 

2 

.540 

2 

.170 

4 

2 

.483 

2 

.549 

2 

.177 


Since the conforming multigrid method with the V-cycle and arbitrary smoothing step is convergent 
we can choose S such that 1 > 6 > — and ||CMG\k_i||fc_i < 

5. EXPERIMENTAL RESULTS 


We implement the standard nonconforming multigrid algorithm S-NCMG in [4], the modified 
standard nonconforming multigrid algorithm M-NCMG in [3] and NC-CMG with the V-cycle for 
the Laplace’s equation 


—A u = — 1 in ft = unit square, 
u = 0 on d(l . 

Let {^i, be the basis of V* such that each <f>* equals 1 at exactly one midpoint and equals 
0 at all other midpoints. The stiffness matrix representing <**(•, •) with respect to this basis of 
nonconforming space has at most five entries per row. In the conforming case, the stiffness matrix 
has again at most five entries per row. Therefore z m can be obtained from zq by iterating a sparse 
band matrix. We use the Gershgorin theorem in order to get the bounds of the maximum 
eigenvalues. These are the rough bounds so that the convergence rate is not optimal, but there is a 
trade-off because finding the exact maximum eigenvalue costs more. Note that the matrix for /£_] 
has again at most five entries per row. 

We take an initial guess zo = 0. The programs execute the multigrid iterations until the discrete 
energy norm of the real error is below the tolerance 1 / (number of basis) for various mesh size and 
the number of smoothing. The real solution comes from the SSOR preconditioning conjugate 
gradient method for the five point finite difference scheme in which the difference of two consecutive 
solutions is less than the tolerance 10 -9 in the descrete I 2 sense. The experiments reported here 
were run in double-precision arithmetic on CM-5 Vector Units which has 32K processors. 

There are many ways to measure the performance of a parallel algorithm running on a parallel 
processor(see [9]). The most important and commonly used metric is the elapsed cpu time to run a 
job on a given machine even though it depends on how to optimize the program. We used the 
power method to get the rate of convergence. In the Table V-VIII the rate of convergence of 
S-NCMG and M-NCMG is slightly smaller or larger than the rate of convergence of NC-CMG. 
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Table II: Number of Grid = 16 i.e. h = 1/16 



S-NCMG 

M-NCMG 

NC-CMG 

smoothing 

iter 

time(sec) 

iter 

time(sec) 

iter 

time(sec) 

1 

7 

2.604 

5 

2.089 

5 

.766 

2 

4 

1.526 

3 

1.187 

3 

.481 

3 

3 

1.183 

3 

1.247 

3 

.512 

4 

3 

1.212 

3 

1.240 

2 

.360 


Table III: Number of Grid = 32 i.e. h = 1/32 



S-NCMG 

M-NCMG 

NC-CMG 

smoothing 

iter 

time (sec) 

iter 

time(sec) 

iter 

time (sec) 

1 

10 

6.037 

7 

4.294 

7 

1.625 

2 

6 

3.723 

5 

3.163 

4 

.970 

3 

5 

3.196 

4 

2.573 

4 

1.034 

4 

4 

2.641 

3 

1.975 

3 

.832 


Table IV: Number of Grid = 64 i.e. h = 1/64 



S-NCMG 

M-NCMG 

NC-CMG 

smoothing 

iter 

time(sec) 

iter 

time(sec) 

iter 

time(sec) 

1 

14 

16.668 

10 

11.879 

9 

2.874 

2 

8 

9.560 

7 

8.396 

5 

1.692 

3 

6 

7.196 

5 

6.059 

4 

1.447 

4 

5 

6.200 

4 

4.987 

4 

1.544 


Table V: Number of Grid = 8 i.e. h = 1/8 



S-NCMG 

M-NCMG 

NC-CMG ; 

smoothing 

rate of conv. 

rate of conv. 

rate of conv. 

1 

.903 

.903 

.906 

2 

.815 

.815 

.820 

3 

.736 

.736 

.742 

4 

.665 

.665 

.672 





Table VI: Number of Grid = 16 i.e. h = 1/16 



S-NCMG 

M-NCMG 

NC-CMG 

smoothing 

rate of conv. 

rate of conv. 

rate of conv. 

1 

.904 

.904 

.910 

2 

.817 

.818 

.829 

3 

.739 

.739 

.754 

4 

.668 

.669 

.687 


Table VII: Number of Grid = 32 i.e. h = 1/32 



S-NCMG 

M-NCMG 

NC-CMG 

smoothing 

rate of conv. 

rate of conv. 

rate of conv. 

1 

.904 

.904 

.911 

2 

.818 

.818 

.830 

3 

.740 

.740 

.757 

4 

.669 

.669 

.689 


Table VIII: Number of Grid = 64 i.e. h = 1/64 



S-NCMG 

M-NCMG 

NC-CMG 

smoothing 

rate of conv. 

rate of conv. 

rate of conv. 

1 

.904 

.904 

.911 

2 

.939 

.818 

.830 

3 

.888 

.740 

.757 

4 

.773 

.669 

.690 







Figure 1: Nonconforming vs. conforming. 


In Figure 1, (A) and (B) represent the location of the nodal basis of nonconforming finite 
elements and conforming finite elements, respectively. Squares represent the basis in V k -i or W k - 1 
and circles represent the basis in V k or W k . In the correction step the centered black square is 
communicating with the black circles around it. Therefore S-NCMG and M-NCMG need further 
communications. Since the performance is determined mainly by the communication time in a 
massively parallel machine like CM-5, S-NCMG and M-NCMG require more cpu time than 
NC-CMG. It is shown in tables I-IV. Moreover NC-CMG does less computation and is easier to 
implement because the number of the basis of V k is approximately three times of that of W k and 

w k . ! c w k . 
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